AITopics | better answer

Collaborating Authors

better answer

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Right this way: Can VLMs Guide Us to See More to Answer Questions?

Neural Information Processing SystemsFeb-18-2026, 15:31:01 GMT

In question-answering scenarios, humans can assess whether the available information is sufficient and seek additional information if necessary, rather than providing a forced answer.

information, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Cruz County > Santa Cruz (0.04)
Europe > Switzerland (0.04)
Asia > Singapore (0.04)
Asia > Indonesia > Bali (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.93)

Industry:

Information Technology (0.67)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
(2 more...)

Add feedback

Rude ChatGPT prompts, better answers? What the data says

FOX NewsOct-21-2025, 12:54:41 GMT

This material may not be published, broadcast, rewritten, or redistributed. Quotes displayed in real-time or delayed by at least 15 minutes. Market data provided by Factset . Powered and implemented by FactSet Digital Solutions . Mutual Fund and ETF data provided by Refinitiv Lipper .

better answer, fox new show programming schedule, lifestyle real estate tech science, (8 more...)

FOX News

Country: North America > United States > Illinois > Cook County > Chicago (0.04)

Industry:

Media (1.00)
Leisure & Entertainment > Sports (1.00)
Banking & Finance (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Right this way: Can VLMs Guide Us to See More to Answer Questions?

Neural Information Processing SystemsOct-10-2025, 21:01:09 GMT

In question-answering scenarios, humans can assess whether the available information is sufficient and seek additional information if necessary, rather than providing a forced answer.

better answer, dataset, information, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Cruz County > Santa Cruz (0.04)
Europe > Switzerland (0.04)
Asia > Singapore (0.04)
Asia > Indonesia > Bali (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.93)

Industry:

Information Technology (0.67)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
(2 more...)

Add feedback

Explicit v.s. Implicit Memory: Exploring Multi-hop Complex Reasoning Over Personalized Information

Zhang, Zeyu, Zhang, Yang, Tan, Haoran, Li, Rui, Chen, Xu

arXiv.org Artificial IntelligenceAug-20-2025

In large language model-based agents, memory serves as a critical capability for achieving personalization by storing and utilizing users' information. Although some previous studies have adopted memory to implement user personalization, they typically focus on preference alignment and simple question-answering. However, in the real world, complex tasks often require multi-hop reasoning on a large amount of user information, which poses significant challenges for current memory approaches. To address this limitation, we propose the multi-hop personalized reasoning task to explore how different memory mechanisms perform in multi-hop reasoning over personalized information. We explicitly define this task and construct a dataset along with a unified evaluation framework. Then, we implement various explicit and implicit memory methods and conduct comprehensive experiments. We evaluate their performance on this task from multiple perspectives and analyze their strengths and weaknesses. Besides, we explore hybrid approaches that combine both paradigms and propose the HybridMem method to address their limitations. We demonstrate the effectiveness of our proposed model through extensive experiments. To benefit the research community, we release this project at https://github.com/nuster1128/MPR.

information, large language model, natural language, (15 more...)

arXiv.org Artificial Intelligence

2508.1325

Country:

North America > United States (0.48)
Asia (0.46)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology (0.54)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.66)

Add feedback

GRP: Goal-Reversed Prompting for Zero-Shot Evaluation with LLMs

Song, Mingyang, Zheng, Mao, Luo, Xuan

arXiv.org Artificial IntelligenceMar-8-2025

Using Large Language Models (LLMs) to evaluate and compare two answers from different models typically involves having LLM-based judges select the better answer. However, humans often approach problem-solving from a reverse perspective, for instance, by choosing the worse option instead of the better one in a pairwise comparison. Generally, this kind of reverse thinking plays a crucial role in human reasoning and decision-making and can further test the difference between original and reverse thought processes simultaneously. To address the above issue, in this paper, we propose a Goal-Reversed Prompting (GRP) approach for pairwise evaluation that shifts the original task from selecting the better answer to choosing the worse one. We encourage LLMs to think in reverse by prompting LLMs to identify the worse response. Experiments on closed-source models demonstrate that GRP significantly enhances evaluation capabilities, outperforming the prompt template with the original goal.

evaluation, language model, template, (15 more...)

arXiv.org Artificial Intelligence

2503.06139

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Europe > Spain (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)

Add feedback

Right this way: Can VLMs Guide Us to See More to Answer Questions?

Liu, Li, Yang, Diji, Zhong, Sijia, Tholeti, Kalyana Suma Sree, Ding, Lei, Zhang, Yi, Gilpin, Leilani H.

arXiv.org Artificial IntelligenceNov-1-2024

In question-answering scenarios, humans can assess whether the available information is sufficient and seek additional information if necessary, rather than providing a forced answer. In contrast, Vision Language Models (VLMs) typically generate direct, one-shot responses without evaluating the sufficiency of the information. To investigate this gap, we identify a critical and challenging task in the Visual Question Answering (VQA) scenario: can VLMs indicate how to adjust an image when the visual information is insufficient to answer a question? This capability is especially valuable for assisting visually impaired individuals who often need guidance to capture images correctly. To evaluate this capability of current VLMs, we introduce a human-labeled dataset as a benchmark for this task. Additionally, we present an automated framework that generates synthetic training data by simulating ``where to know'' scenarios. Our empirical results show significant performance improvements in mainstream VLMs when fine-tuned with this synthetic data. This study demonstrates the potential to narrow the gap between information assessment and acquisition in VLMs, bringing their performance closer to humans.

large language model, machine learning, question answering, (18 more...)

arXiv.org Artificial Intelligence

2411.00394

Country:

North America > United States > California > Santa Cruz County > Santa Cruz (0.04)
Europe > Switzerland (0.04)
Asia > Singapore (0.04)
Asia > Indonesia > Bali (0.04)

Genre: Research Report > New Finding (0.88)

Industry: Health & Medicine (0.35)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.55)

Add feedback

Beyond Scalar Reward Model: Learning Generative Judge from Preference Data

Ye, Ziyi, Li, Xiangsheng, Li, Qiuchi, Ai, Qingyao, Zhou, Yujia, Shen, Wei, Yan, Dong, Liu, Yiqun

arXiv.org Artificial IntelligenceOct-13-2024

Learning from preference feedback is a common practice for aligning large language models~(LLMs) with human value. Conventionally, preference data is learned and encoded into a scalar reward model that connects a value head with an LLM to produce a scalar score as preference or reward. However, scalar models lack interpretability and are known to be susceptible to biases in datasets. This paper investigates leveraging the generation capability of LLMs to address both limitations in one shot. Specifically, we prompt the pre-trained LLM to generate positive and negative judgments, both supported with rationales in natural language form. The self-generated contrastive judgment pairs are used to train the generative judge with Direct Preference Optimization (DPO). This proposal of training the generative Judge using self-generated Contrastive judgments (Con-J) ensures natural interpretability due to the generated rationales together with the judgments, as well as high robustness against bias without the need for an additional reward head. Experimental results show that the performance of Con-J is comparable to the scalar reward model trained on the same collection of preference data, and demonstrate its superior interpretability and robustness in encoding human preferences.

con-j, judgment, rationale, (13 more...)

arXiv.org Artificial Intelligence

2410.03742

Country: Europe > Denmark > Capital Region > Copenhagen (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ChatGPT vs. Google Bard: Which gives the better answers?

#artificialintelligenceApr-2-2023, 16:30:25 GMT

Generative AI models are the hot new thing in the Big Tech world, and everyone is joining the race. The buzz really only started with OpenAI's ChatGPT chatbot, a generative AI language model that is incredibly good at predicting which words should follow one another when you feed it with prompts. Google has long been working on a similar technology, dubbed LaMDA, and with ChatGPT taking the world by storm, the company saw itself forced to release some version of its AI model to the world. That's how we got Bard, Google's first publicly available chat-based generative language model, with access to many parts of the internet. But is Google really at the same level as ChatGPT already?

bard, chatgpt, google bard, (16 more...)

#artificialintelligence

Country:

North America > United States > New York (0.06)
Europe > Sweden > Skåne County > Malmö (0.05)

Genre: Play (0.32)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.80)

Add feedback

DeepMind's new chatbot uses Google searches plus humans to give better answers

MIT Technology ReviewSep-22-2022, 14:00:00 GMT

The difference between this approach and its predecessors is that DeepMind hopes to use "dialogue in the long term for safety," says Geoffrey Irving, a safety researcher at DeepMind. "That means we don't expect that the problems that we face in these models--either misinformation or stereotypes or whatever--are obvious at first glance, and we want to talk through them in detail. And that means between machines and humans as well," he says. DeepMind's idea of using human preferences to optimize how an AI model learns is not new, says Sara Hooker, who leads Cohere for AI, a nonprofit AI research lab. "But the improvements are convincing and show clear benefits to human-guided optimization of dialogue agents in a large-language-model setting," says Hooker. Douwe Kiela, a researcher at AI startup Hugging Face, says Sparrow is "a nice next step that follows a general trend in AI, where we are more seriously trying to improve the safety aspects of large-language-model deployments."

deepmind, new chatbot use google search, use google search plus human, (3 more...)

MIT Technology Review

Industry: Information Technology > Services (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

How To Ace ML Interview Questions

#artificialintelligenceMar-22-2021, 20:40:05 GMT

Suppose you get a call from the recruiter of your dream company where you have applied for the ML Engineer role. You have set a date and started preparation with an ML study guide like this one or similar. On the day of the interview, you are able to answer all the questions and are confident that you will move onto the onsite stage. However, you get a call from the recruiter saying that they have decided not to go forward. It is not enough to answer the question, because the interviewer wants to see that you have a deep understanding of the topic/question.

ace ml interview question, linear regression, regression, (9 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.47)

Add feedback